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Multidimensional access methods 
Volker Gaede, Oliver Gunther 

June 1998 ACM Computing Surveys (CSUR), volume 30 issue 2 

Additional Information: full citation, abstract, re f erences , citin gs, index 
terms 



Full text available: H pdf(1 .05 MB) 



Search operations in databases require special support at the physical level. This is true for 
conventional databases as well as spatial databases, where typical search operations 
include the point query (find all objects that contain a given search point) and the region 
query (find all objects that overlap a given search region). More than ten years of spatial 
database research have resulted in a great variety of multidimensional access methods to 
support ... 

Keywords: data structures, multidimensional access methods 



2 External memory algorithms and data structures: dealing with massive data 
Jeffrey Scott Vitter 



June 2001 ACM Computing Surveys (CSUR), volume 33 issue 2 

Additional Information: full citation , abstract , references , citings , index 
terms 



Full text available: g pdf(828.46 KB) 



Data sets in large applications are often too massive to fit completely inside the computers 
internal memory. The resulting input/output communication (or I/O) between fast internal 
memory and slower external memory (such as disks) can be a major performance 
bottleneck. In this article we survey the state of the art in the design and analysis of 
external memory (or EM) algorithms and data structures, where the goal is to exploit 
locality in order to reduce the I/O costs. We consider a varie ... 

Keywords: B-tree, I/O, batched, block, disk, dynamic, extendible hashing, external 
memory, hierarchical memory, multidimensional access methods, multilevel memory, 
online, out-of-core, secondary storage, sorting 



Ext3cow: a time-shifting file system for regulatory compliance 
Zachary Peterson, Randal Burns 

May 2005 ACM Transactions on Storage (TOS), volume l issue 2 

Full text available: ^pdf(443.01 KB) Additional Information: full citation , abstract , references , index terms 
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The ext3cow file system, built on the popular ext3 file system, provides an open-source file 
versioning and snapshot platform for compliance with the versioning and audtitability 
requirements of recent electronic record retention legislation. Ext3cow provides a time- 
shifting interface that permits a real-time and continuous view of data in the past. Time- 
shifting does not pollute the file system namespace nor require snapshots to be mounted as 
a separate file system. Further, ext3cow is i ... 

Keywords: Versioning file systems, copy-on-write 



Fast and secure distributed read-only file system 
Kevin Fu, M. Frans Kaashoek, David Mazieres 

February 2002 ACM Transactions on Computer Systems (TOCS), Volume 20 issue 1 

Full text available: ^ pdf(317.54 K B) Additional Information: full citation , abstra ct, references , index terms 

Internet users increasingly rely on publicly available data for everything from software 
installation to investment decisions. Unfortunately, the vast majority of public content on 
the Internet comes with no integrity or authenticity guarantees. This paper presents the 
self-certifying read-only file system, a content distribution system providing secure, 
scalable access to public, read-only data.The read-only file system makes the security of 
published content independent from that of the distri ... 

Keywords: File systems, read-only, security 



5 Deciding when to forget in the Elephant file system 

Douglas S. Santry, Michael J. Feeley, Norman C. Hutchinson, Alistair C. Veitch, Ross W. 
Carton, Jacob Ofir 

December 1999 ACM SIGOPS Operating Systems Review , Proceedings of the 

seventeenth ACM symposium on Operating systems principles, volume 33 

Issue 5 

Full text available* fiCl pdfd.61 MB) Additional Information: full citation , abstract , references , citings , index 
^ * terms 

Modern file systems associate the deletion of a file with the immediate release of storage, 
and file writes with the irrevocable change of file contents. We argue that this behavior is a 
relic of the past, when disk storage was a scarce resource. Today, large cheap disks make it 
possible for the file system to protect valuable data from accidental delete or overwrite.This 
paper describes the design, implementation, and performance of the Elephant file system, 
which automatically retains all impo ... 

6 The evolution of Coda | 
M. Satyanarayanan 

May 2002 ACM Transactions on Computer Systems (TOCS), Volume 20 issue 2 

Full text available* fi3 pdf(441 35 KB) Additi °nal Information: full citation , abstract , references , citings , index 

! - terms 

Failure-resilient, scalable, and secure read-write access to shared information by mobile 
and static users over wireless and wired networks is a fundamental computing challenge. In 
this article, we describe how the Coda file system has evolved to meet this challenge 
through the development of mechanisms for server replication, disconnected operation, 
adaptive use of weak connectivity, isolation-only transactions, translucent caching, and 
opportunistic exploitation of hardware surrogates. For eac ... 

Keywords: Adaptation, Linux, UNIX, Windows, caching, conflict resolution, continuous data 
access, data staging, disaster recovery, disconnected operation, failure, high availability, 
hoarding, intermittent networks, isolation-only transactions, low-bandwidth networks, 
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mobile computing, optimistic replica control, server replication, translucent cache 
management, weakly connected operation 



7 Alg orithms and data structures for flash memories 
Eran Gal, Sivan Toledo 

June 2005 ACM Computing Surveys (CSUR), volume 37 issue 2 

Full text available: ^ pdf(343.39 KB) Additional Information: full citation , abstract, references , index terms 

Flash memory is a type of electrically-erasable programmable read-only memory 
(EEPROM). Because flash memories are nonvolatile and relatively dense, they are now used 
to store files and other persistent objects in handheld computers, mobile phones, digital 
cameras, portable music players, and many other computer systems in which magnetic 
disks are inappropriate. Flash, like earlier EEPROM devices, suffers from two limitations. 
First, bits can only be cleared by erasing a large block of memory. S ... 

Keywords: EEPROM memory, Flash memory, wear leveling 



Im provin g storage s yste m availability with D-GRAID 

Muthian Sivathanu, Vijayan Prabhakaran, Andrea C Arpaci-Dusseau, Remzi H. Arpaci-Dusseau 
May 2005 ACM Transactions on Storage (TOS), Volume l issue 2 

Full text available:^ pdf(700.30 KB) Additional Information: full citation, abstract , references , index terms 

We present the design, implementation, and evaluation of D-GRAID, a gracefully degrading 
and quickly recovering RAID storage array. D-GRAID ensures that most files within the file 
system remain available even when an unexpectedly high number of faults occur. D-GRAID 
achieves high availability through aggressive replication of semantically critical data, and 
fault-isolated placement of logically related data. D-GRAID also recovers from failures 
quickly, restoring only live file system data to a h ... 

Keywords: Block-based storage, Disk array, RAID, fault isolation, file systems, smart disks 



9 The Grid File: An Adaptable, Symmetric Multikey File Structure 
j. Nievergelt, Hans Hinterberger, Kenneth C. Sevcik 

March 1984 ACM Transactions on Database Systems (TODS), volume 9 issue l 

Full text available* 1Sl Ddf(2 35 MB) Additional Information: full citation , abstract , references , citings, index 

terms 

Traditional file structures that provide multikey access to records, for example, inverted 
files,-are extensions of file structures originally designed for single-key access. They 
manifest various deficiencies in particular for multikey access to highly dynamic files. We 
study the dynamic aspects of file structures that treat all keys symmetrically, that is, file 
structures which avoid the distinction between primary and secondary keys. We start from 
a bitmap approach and treat the problem ... 

10 Hancock: A lan g ua g e for analyzing transactional data streams 
Corinna Cortes, Kathleen Fisher, Daryl Pregibon, Anne Rogers, Frederick Smith 

March 2004 ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 26 Issue 2 

Full text available: ^ pdf(21 7.55 KB) Additional Information: fu ll citation , abstract, references , index terms 

Massive transaction streams present a number of opportunities for data mining techniques. 
The transactions in such streams might represent calls on a telephone network, commercial 
credit card purchases, stock market trades, or HTTP requests to a web server. While 
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historically such data have been collected for billing or security purposes, they are now 
being used to discover how the transactors, for example, credit-card numbers or IP 
addresses, use the associated services. Over the past 5 years, w ... 

Keywords: Domain-specific languages, data mining, statistical models 



11 The Quadtree and Related Hierarchical Data Structures 
Hanan Samet 

June 1984 ACM Computing Surveys (CSUR), volume 16 issue 2 

Full text available: fiQ pdf(4.87 MB) Additional Information: full citation , references , citings , index terms 



12 S ystems and applications: A hybrid approach to optimistic file system directory tree 
s ynchronization 

Tancred Lindholm, Jaakko Kangasharju, Sasu Tarkoma 

June 2005 Proceedings of the 4th ACM international workshop on Data engineering for 
wireless and mobile access 

Full text available: *g| pdf(220.29 KB) Additional Information: full citation , abstract , references , index terms 

There are two main approaches to optimistic file system synchronization: distributed file 
systems and file synchronizers. The former type is characterized by a log-based approach 
that depends on access to file system internals, the latter by a state-based approach that 
utilizes the standard file system interface, which limits the efficiency of change 
detection. We propose a hybrid approach that 1) defines a minor extension to the semantics 
of the file system interface that enables efficient state ... 

Keywords: XML, directory tree, optimistic synchronization, reconciliation, stackable file 
system, state-based 



13 OceanStore: an architecture for g lobal-scale persistent storage 

John Kubiatowicz, David Bindel, Yan Chen, Steven Czerwinski, Patrick Eaton, Dennis Geels, 
Ramakrishna Gummadi, Sean Rhea, Hakim Weatherspoon, Chris Wells, Ben Zhao 
November 2000 Proceedings of the ninth international conference on Architectural 

support for programming Janguages and operating systems, volume 28 , 

34 Issue 5,5 

Full text available: fg|Ddff166.53 KB) AdditJonal Information: full citation , abstract, references , citings, iodex 
terms 

OceanStore is a utility infrastructure designed to span the globe and provide continuous 
access to persistent information. Since this infrastructure is comprised of untrusted servers, 
data is protected through redundancy and cryptographic techniques. To improve 
performance, data is allowed to be cached anywhere, anytime. Additionally, monitoring of 
usage patterns allows adaptation to regional outages and denial of service attacks; 
monitoring also enhances performance through pro-active movement ... 

1 4 The string B-tree: a new data structure for strin g s ea r ch in external memory and its 
applications 

Paolo Ferragina, Roberto Grossi 

March 1999 Journal of the ACM (J ACM), volume 46 issue 2 

Full text available* pdf(363 37 KB) Additi °nal Information: full citation , abstract , references , citings, index 
. L/y-^ : terms 

We introduce a new text-indexing data structure, the String B-Tree, that can be seen as a 
link between some traditional external-memory and string-matching data structures. In a 
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short phrase, it is a combination of B-trees and Patricia tries for internal-node indices that is 
made more effective by adding extra pointers to speed up search and update operations. 
Consequently, the String B-Tree overcomes the theoretical limitations of inverted files, B- 
trees, prefix B-trees, s ... 

Keywords: B-tree, Patricia trie, external-memory data structure, prefix and range search, 
string searching and sorting, suffix array, suffix tree, text index 



15 Some experiments in directory or ganization - a simulation study 
Allen Reiter 

March 1976 Proceedings of the 1976 ACM SIGMETRICS conference on Computer 
performance modeling measurement and evaluation 

Full text available: f£| pdf(707.49 KB) Additional Information: full citation , abstract , references , citings , index 
. i£| ^ terms 

Using a simulation model, experiments were conducted on various directory organization 
schemes and their performance implications. In particular we tested the effects of a 
multiprogrammed environment on system throughput for retrieval operations. Analysis of 
the results shows that different factors are relevant to performance for the various systems, 
and that under some circumstances ISAM and hash-coding may lose the advantages they 
possess over B-trees in a stand-alone environment when mul ... 

Keywords: B-tree, Directories, Hash-coding, Index sequential, Performance analysis, 
Throughput prediction 



16 OceanSt ore: an architecture for glo bal-scale persistent stora g e 

John Ku'biatowicz, David Bindel, Yan Chen, Steven Czerwinski, Patrick Eaton, Dennis Geels, 
Ramakrishan Gummadi, Sean Rhea, Hakim Weatherspoon, Westley Weimer, Chris Wells, Ben 
Zhao 

November 2000 ACM SIGPLAN Notices, volume 35 issue u 

Full text available: ^ pdf(1.47 MB) Additional Information: full citation , abstract , references , index terms 

OceanStore is a utility infrastructure designed to span the globe and provide continuous 
access to persistent information. Since this infrastructure is comprised of untrusted servers, 
data is protected through redundancy and cryptographic techniques. To improve 
performance, data is allowed to be cached anywhere, anytime. Additionally, monitoring of 
usage patterns allows adaptation to regional outages and denial of service attacks; 
monitoring also enhances performance through pro-active movement ... 



SaveMe: a system f or archivin g electronic docum e nts usin g messaging groupware 
Stefan Berchtold, Alexandras Biliris, Euthimios Panagos 

March 1999 ACM SIGSOFT Software Engineering Notes , Proceedings of the 

international joint conference on Work activities coordination and 

collaboration, Volume 24 Issue 2 
Full text available: ^ pdff1.47 MB) Additional Information: full citation , abstract , references , index terms 

Today, organizations deal with an ever-increasing number of documents that have to be 
archived because they are either related to their core business (e.g., product designs) or 
needed to meet corporate or legal retention requirements (e.g., voucher). In this paper, we 
present the architecture and prototype implementation of SaveMe, a document archival 
system that is based on network-centric groupware such as Internet standards-based 
messaging systems. In SaveMe, the actions of archiving, retriev ... 




Keywords: Internet, archiving, groupware, messaging 
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18 Improving the granularity of access control for Windows 2000 

Michael M. Swift, Anne Hopkins, Peter Brundrett, Cliff Van Dyke, Praerit Garg, Shannon Chan, 
Mario Goertzel, Gregory Jensenworth 

November 2002 ACM Transactions on Information and System Security (TISSEC), volume 
5 Issue 4 

Full text available- "fSJ odf(447 78 KB) Additional Information: full citation , abstract , references , citings , index 

: terms , review 

This .article presents the mechanisms in Windows 2000 that enable fine-grained and 
centrally managed access control for both operating system components and applications. 
These features were added during the transition from Windows NT 4.0 to support the Active 
Directory, a new feature in Windows 2000, and to protect computers connected to the 
Internet. While the access control mechanisms in Windows NT are suitable for file systems 
and applications with simple requirements, they fall short of the ... 

Keywords: Access control lists, Microsoft Windows 2000, Windows NT, active directory 



19 Memory management during run generation in external sorting 
Per-Ake Larson, Goetz Graefe 

June 1998 ACM SIGMOD Record , Proceedings of the 1998 ACM SIGMOD international 

conference on Management of data, volume 27 issue 2 
Full text available: 1jBpdf(2.08 MB) Additional Information: full citation, abstract, references, citings, index 

- terms 

If replacement selection is used in an external mergesort to generate initial runs, individual 
records are deleted and inserted in the sort operation's workspace. Variable-length records 
introduce the need for possibly complex memory management and extra copying of 
records. As a result, few systems employ replacement selection, even though it produces 
longer runs than commonly used algorithms. We experimentally compared several 
algorithms and variants for managing this workspace. We found t ... 

Keywords: last-run optimization, memory management, merge sort, replacement 
selection, run formation, sorting, variable length records 



20 Emb edded s ystems: application s, solution s and techniques ( EMBS): An efficient 
management scheme for large-scale flash-memory storage systems 
Li-Pin Chang, Tei-Wei Kuo 

March 2004 Proceedings of the 2004 ACM symposium on Applied computing 

Full text available: ff | p df( 299.99 KB) Additional Information: full citation, abstract, references , citings, index 
* ■ terms 

Flash memory is among the top choices for storage media in ubiquitous computing. With a 
strong demand of high-capacity storage devices, the usages of flash memory quickly grow 
beyond their original designs. The very distinct characteristics of flash memory introduce 
serious challenges to engineers in resolving the quick degradation of system performance 
and the huge demand of main-memory space for flash-memory management when high- 
capacity flash memory is considered. Although some brute-force so ... 

Keywords: consumer electronics, embedded systems, flash memory, memory 
management, portable devices, storage systems 
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October 1998 Proceedings of the third ACM workshop on Role-based access control 

Full text available:^ pdf( 4. 62 MB ) Additional Information: full citat i o n, refere n ces , citings, index te rms 



Capturin g dynamic m e mory reference b e hav i or with adap ti ve c ac h e topol o gy 
Jih-Kwon Peir, Yongjoon Lee, Windsor W. Hsu 

October 1998 Proceedings of the eighth international conference on Architectural 

support for programming languages and operating systems, volume 33 , 32 

Issue 11,5 

Full text available* ff| pdfd 50 MB) Additional Information: Mcitatfon, abstract, refexejices, citings, index 
• yy t erms 

Memory references exhibit locality and are therefore not uniformly distributed across the 
sets of a cache. This skew reduces the effectiveness of a cache because it results in the 
caching of a considerable number of less-recently-used lines which are less likely to be re- 
referenced before they are replaced. In this paper, we describe a technique that 
dynamically identifies these less-recently-used lines and effectively utilizes the cache 
frames they occupy to more accurately approximate the glob ... 

H i sto r y-based access co ntr ol f or m obile code 
Guy Edjlali, Anurag Acharya, Vipin Chaudhary 

November 1998 Proceedings of the 5th ACM conference on Computer and 
communications security 

Full text available:^ pdf( 1.33 MB) Additional Information: f u ll ci ta tio n, references , c itings, in dex terms 



4 Document archiving, replication and migration container for mobile Web users 
Peter Stanski, Stephen Giles, Arkady Zaslavsky 

February 1998 Proceedings of the 1998 ACM symposium on Applied Computing 

Full text available: ^ pdf(809.84 KB) Additional Information: f ull citation , refer ences , index te r ms 
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5 A user interface using fingerprint recognition: holding commands and data objects on Q 
fingers 

Atsushi Sugiura, Yoshiyuki Koseki 

November 1998 Proceedings of the 11th annual ACM symposium on User interface 
software and technology 

Full text available: ^| pdf(226.Q2 KB) Additional Information: full citation , references , citings , index terms 



Keywords: fingerprint recognition, input devices, mulimodal user interfaces, multi- 
computer user interfaces 



Session directories and scalable Internet multicast address allocation 
Mark Handley 

October 1998 ACM SIGCOMM Computer Communication Review , Proceedings of the 
ACM SIGCOMM '98 conference on Applications, technologies, 
architectures, and protocols for computer communication, volume 28 issue 4 

Full text available* fi3 df(1 63 MB) Additional Information: full citation , abstract , references, citings, index 
• 12J ^ terms 

A multicast session directory is a mechanism by which users can discover the existence of 
multicast sessions. In the Mbone, session announcements have also served as multicast 
address reservations - a dual purpose that is efficient, but which may cause some side- 
affects as session directories scale. In this paper we examine the scaling of multicast 
address allocation when it is performed by such a multicast session directory. Despite our 
best efforts to make such an approach scale, this analysis ... 

Productivity tools for web-based information 
Robin Green 

September 1998 Proceedings of the 16th annual international conference on Computer 
documentation 

Full text available: *Q pdf(836.75 KB) Additional Information: full citation , references , citin gs, index terms 



8 The turn model for adaptive routing 
Christopher J. Glass, Lionel M. Ni 

August 1998 25 years of the international symposia on Computer architecture 
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Full text available: ^£) pdf(1.08 MB) Additional Information: full citation , references , citings , index terms 
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Jianying Zhou, Kwok-Yan Lam 

October 1998 Proceedings of the 4th annual ACM/IEEE international conference on 
Mobile computing and networking 

Full text available: ^ pdf(864.03 KB) Additional Information: full citation , references , index terms 
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11 Public-key cryptography and password protocols 
Shai Halevi, Hugo Krawczyk 

November 1998 Proceedings of the 5th ACM conference on Computer and 
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Full text available: *^ pdf(1.28 MB) Additional Information: full citation, references, citings, index terms 
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1 3 On-l ine j ourn al: a tool for enchanc ing student journ al s Q 
Robert Riser, Donald Gotterbarn 

August 1998 ACM SIGCSE Bulletin , Proceedings of the 6th annual conference on the 
teaching of computing and the 3rd annual conference on Integrating 
technology into computer science education: Changing the delivery of 
computer science education, volume 30 issue 3 

Full text available: ^ pdf(392.08 KB) Additional Information: full citation , abstract , references , index terms 

This paper discusses the development of a web-based on-line journal to replace a 
traditional project journal in a writing intensive undergraduate software engineering course. 
The on-line journal allows students to conveniently maintain their project journals while 
allowing the instructor to more effectively review student journals and provide timely 
feedback. 

Keywords: journal, software engineering 
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March 1998 ACM SZGCSE Bulletin , Proceedings of the twenty-ninth SIGCSE technical 
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On-line programming tests and examinations were administered to approximately 120 first 
year computer science students in order to evaluate their practical skills. We describe our 
motivation for on-line testing, outline the technical details of our closed testing 
environment, and present our observations about student performance. We also compare 
the effectiveness of on-line tests versus conventional tests, report the problems we 
encountered and our solutions, relate student opinion regarding th ... 
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Multicast routing enables efficient data distribution to multiple recipients. However, existing 
work has concentrated on extending single-domain techniques to wide-area networks, 
rather than providing mechanisms to realize inter-domain multicast on a global scale in the 
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A multicast session directory is a mechanism by which users can discover the existence of 
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address reservations - a dual purpose that is efficient, but which may cause some side- 
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This paper presents data on the performance of the NAS SP2 IBM system using RS6000 
hardware monitors and a performance measurement tool. The data collected showed that 
the SP2 averages about 1.3 Gflops, about 3% of peak. The report provides the relative 
usage for the various hardware units over the entire workload measured over a 9-month 
period. The workload displays moderate parallelism, with the most popular choice of nodes 
as 16. Although the monitor data provide a good snapshot of workload p ... 
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Virtual memory is a staple in modem systems, though there is little agreement on how its 
functionality is to be implemented on either the hardware or software side of the interface. 
The myriad of design choices and incompatible hardware mechanisms suggests potential 
performance problems, especially since increasing numbers of systems (even embedded 
systems) are using memory management. A comparative study of the implementation 
choices in virtual memory should therefore aid system-level designers ... 
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